Dataset info
| Number of variables | 38 |
|---|---|
| Number of observations | 3837 |
| Missing cells | 478 (0.3%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 304.0 B |
Variables types
| Numeric | 11 |
|---|---|
| Categorical | 3 |
| Boolean | 18 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 1 |
| Rejected | 5 |
| Unsupported | 0 |
Warnings
budget has 497 (13.0%) zeros | Zeros |
overseas-gross has 460 (12.0%) missing values | Missing |
overseas-pct has 462 (12.0%) zeros | Zeros |
revenues is highly correlated with overseas-gross (ρ = 0.9717238163) | Rejected |
studio has a high cardinality: 203 distinct values | Warning |
title has a high cardinality: 3801 distinct values | Warning |
TV_Movie has constant value "0" | Rejected |
Unnamed_0_x is highly correlated with Unnamed_0 (ρ = 0.9940708728) | Rejected |
Unnamed_0_y is highly correlated with bo_year_rank (ρ = 1) | Rejected |
worldwide-gross is highly correlated with revenues (ρ = 0.9906464716) | Rejected |
Action
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 2801 | 73.0% | |
| 1 | 1036 | 27.0% |
Adventure
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 3125 | 81.4% | |
| 1 | 712 | 18.6% |
Animation
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 246 |
| Value | Count | Frequency (%) | |
| 0 | 3591 | 93.6% | |
| 1 | 246 | 6.4% |
bo_year_rank
Numeric
| Distinct count | 398 |
|---|---|
| Unique (%) | 10.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 106.3468856 |
|---|---|
| Minimum | 1 |
| Maximum | 443 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 34 |
| Median | 83 |
| Q3 | 155 |
| 95-th percentile | 294 |
| Maximum | 443 |
| Range | 442 |
| Interquartile range | 121 |
Descriptive statistics
| Standard deviation | 90.12067176 |
|---|---|
| Coef of variation | 0.847421824 |
| Kurtosis | 0.7948089266 |
| Mean | 106.3468856 |
| MAD | 71.73746538 |
| Skewness | 1.119633974 |
| Sum | 408053 |
| Variance | 8121.735478 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 2 | 31 | 0.8% | |
| 21 | 31 | 0.8% | |
| 4 | 31 | 0.8% | |
| 6 | 31 | 0.8% | |
| 12 | 31 | 0.8% | |
| 3 | 31 | 0.8% | |
| 5 | 31 | 0.8% | |
| 7 | 31 | 0.8% | |
| 8 | 30 | 0.8% | |
| 10 | 30 | 0.8% | |
| Other values (388) | 3529 | 92.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 30 | 0.8% | |
| 2 | 31 | 0.8% | |
| 3 | 31 | 0.8% | |
| 4 | 31 | 0.8% | |
| 5 | 31 | 0.8% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 443 | 1 | < 0.1% | |
| 438 | 1 | < 0.1% | |
| 436 | 1 | < 0.1% | |
| 435 | 1 | < 0.1% | |
| 433 | 1 | < 0.1% |
budget
Numeric
| Distinct count | 383 |
|---|---|
| Unique (%) | 10.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 39998995.12 |
|---|---|
| Minimum | 0 |
| Maximum | 500000000 |
| Zeros (%) | 13.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7400000 |
| Median | 25000000 |
| Q3 | 55000000 |
| 95-th percentile | 150000000 |
| Maximum | 500000000 |
| Range | 500000000 |
| Interquartile range | 47600000 |
Descriptive statistics
| Standard deviation | 47430268.89 |
|---|---|
| Coef of variation | 1.185786511 |
| Kurtosis | 7.013132631 |
| Mean | 39998995.12 |
| MAD | 33984846.5 |
| Skewness | 2.192316187 |
| Sum | 1.534761443e+11 |
| Variance | 2.249630407e+15 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 0 | 497 | 13.0% | |
| 30000000 | 141 | 3.7% | |
| 20000000 | 136 | 3.5% | |
| 40000000 | 129 | 3.4% | |
| 25000000 | 120 | 3.1% | |
| 35000000 | 107 | 2.8% | |
| 50000000 | 106 | 2.8% | |
| 15000000 | 98 | 2.6% | |
| 60000000 | 92 | 2.4% | |
| 10000000 | 92 | 2.4% | |
| Other values (373) | 2319 | 60.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 497 | 13.0% | |
| 93 | 1 | < 0.1% | |
| 4000 | 1 | < 0.1% | |
| 7000 | 1 | < 0.1% | |
| 8000 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 500000000 | 1 | < 0.1% | |
| 380000000 | 1 | < 0.1% | |
| 356000000 | 1 | < 0.1% | |
| 300000000 | 2 | 0.1% | |
| 280000000 | 1 | < 0.1% |
Comedy
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 2413 | 62.9% | |
| 1 | 1424 | 37.1% |
Crime
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 578 |
| Value | Count | Frequency (%) | |
| 0 | 3259 | 84.9% | |
| 1 | 578 | 15.1% |
Documentary
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 96 |
| Value | Count | Frequency (%) | |
| 0 | 3741 | 97.5% | |
| 1 | 96 | 2.5% |
domestic-gross
Numeric
| Distinct count | 1746 |
|---|---|
| Unique (%) | 45.5% |
| Missing (%) | 0.4% |
| Missing (n) | 15 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 55993982.31 |
|---|---|
| Minimum | 400 |
| Maximum | 936700000 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 400 |
|---|---|
| 5-th percentile | 159150 |
| Q1 | 6700000 |
| Median | 31600000 |
| Q3 | 71950000 |
| 95-th percentile | 200795000 |
| Maximum | 936700000 |
| Range | 936699600 |
| Interquartile range | 65250000 |
Descriptive statistics
| Standard deviation | 77861124.8 |
|---|---|
| Coef of variation | 1.390526653 |
| Kurtosis | 18.69540107 |
| Mean | 55993982.31 |
| MAD | 50684872.85 |
| Skewness | 3.390172791 |
| Sum | 2.140090004e+11 |
| Variance | 6.062354755e+15 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 1100000 | 27 | 0.7% | |
| 1300000 | 19 | 0.5% | |
| 1200000 | 17 | 0.4% | |
| 1000000 | 15 | 0.4% | |
| 1400000 | 15 | 0.4% | |
| 2200000 | 14 | 0.4% | |
| 1600000 | 14 | 0.4% | |
| 1700000 | 14 | 0.4% | |
| 2300000 | 13 | 0.3% | |
| 3000000 | 13 | 0.3% | |
| Other values (1735) | 3661 | 95.4% | |
| (Missing) | 15 | 0.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 400 | 1 | < 0.1% | |
| 1000 | 1 | < 0.1% | |
| 1700 | 2 | 0.1% | |
| 1800 | 1 | < 0.1% | |
| 3900 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 936700000 | 1 | < 0.1% | |
| 858400000 | 1 | < 0.1% | |
| 749800000 | 1 | < 0.1% | |
| 700100000 | 1 | < 0.1% | |
| 678800000 | 1 | < 0.1% |
domestic-pct
Numeric
| Distinct count | 906 |
|---|---|
| Unique (%) | 23.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 56.95994266 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 0.5% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11.98 |
| Q1 | 37.6 |
| Median | 53.4 |
| Q3 | 77.3 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 39.7 |
Descriptive statistics
| Standard deviation | 27.0933746 |
|---|---|
| Coef of variation | 0.475656634 |
| Kurtosis | -0.8149988889 |
| Mean | 56.95994266 |
| MAD | 22.38286128 |
| Skewness | 0.08226896555 |
| Sum | 218555.3 |
| Variance | 734.0509471 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 100 | 462 | 12.0% | |
| 0 | 18 | 0.5% | |
| 51.2 | 13 | 0.3% | |
| 55.8 | 12 | 0.3% | |
| 36.4 | 11 | 0.3% | |
| 45.2 | 11 | 0.3% | |
| 29.4 | 11 | 0.3% | |
| 45.7 | 11 | 0.3% | |
| 57.9 | 11 | 0.3% | |
| 43.5 | 11 | 0.3% | |
| Other values (896) | 3266 | 85.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 18 | 0.5% | |
| 0.1 | 6 | 0.2% | |
| 0.2 | 6 | 0.2% | |
| 0.3 | 3 | 0.1% | |
| 0.4 | 4 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 462 | 12.0% | |
| 99.9 | 7 | 0.2% | |
| 99.8 | 4 | 0.1% | |
| 99.7 | 2 | 0.1% | |
| 99.6 | 1 | < 0.1% |
Drama
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 2010 | 52.4% | |
| 1 | 1827 | 47.6% |
Family
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 463 |
| Value | Count | Frequency (%) | |
| 0 | 3374 | 87.9% | |
| 1 | 463 | 12.1% |
Fantasy
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 403 |
| Value | Count | Frequency (%) | |
| 0 | 3434 | 89.5% | |
| 1 | 403 | 10.5% |
Film_Genre
Categorical
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | < 0.1% |
| Missing (n) | 1 |
| Drama | |
|---|---|
| Comedy | |
| Action | |
| Other values (15) |
| Value | Count | Frequency (%) | |
| Drama | 895 | 23.3% | |
| Comedy | 808 | 21.1% | |
| Action | 686 | 17.9% | |
| Adventure | 275 | 7.2% | |
| Horror | 196 | 5.1% | |
| Thriller | 177 | 4.6% | |
| Crime | 166 | 4.3% | |
| Animation | 120 | 3.1% | |
| Romance | 104 | 2.7% | |
| Fantasy | 88 | 2.3% | |
| Other values (8) | 321 | 8.4% |
| Max length | 15 |
|---|---|
| Mean length | 6.44357571 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
History
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 161 |
| Value | Count | Frequency (%) | |
| 0 | 3676 | 95.8% | |
| 1 | 161 | 4.2% |
Horror
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 360 |
| Value | Count | Frequency (%) | |
| 0 | 3477 | 90.6% | |
| 1 | 360 | 9.4% |
imdb_id
Categorical, Unique
| First 5 values |
|---|
| tt0096734 |
| tt0096754 |
| tt0096794 |
| tt0096874 |
| tt0096895 |
| Last 5 values |
|---|
| tt8385474 |
| tt8663516 |
| tt8695030 |
| tt8772262 |
| tt9541602 |
First 5 values
| Value | Count | Frequency (%) | |
| tt0096734 | 1 | < 0.1% | |
| tt0096754 | 1 | < 0.1% | |
| tt0096794 | 1 | < 0.1% | |
| tt0096874 | 1 | < 0.1% | |
| tt0096895 | 1 | < 0.1% |
Last 5 values
| Value | Count | Frequency (%) | |
| tt9541602 | 1 | < 0.1% | |
| tt8772262 | 1 | < 0.1% | |
| tt8695030 | 1 | < 0.1% | |
| tt8663516 | 1 | < 0.1% | |
| tt8385474 | 1 | < 0.1% |
Music
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 119 |
| Value | Count | Frequency (%) | |
| 0 | 3718 | 96.9% | |
| 1 | 119 | 3.1% |
Mystery
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 341 |
| Value | Count | Frequency (%) | |
| 0 | 3496 | 91.1% | |
| 1 | 341 | 8.9% |
overseas-gross
Numeric
| Distinct count | 1703 |
|---|---|
| Unique (%) | 44.4% |
| Missing (%) | 12.0% |
| Missing (n) | 460 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 83057641.13 |
|---|---|
| Minimum | 100 |
| Maximum | 2029200000 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 390800 |
| Q1 | 8000000 |
| Median | 32800000 |
| Q3 | 91700000 |
| 95-th percentile | 349600000 |
| Maximum | 2029200000 |
| Range | 2029199900 |
| Interquartile range | 83700000 |
Descriptive statistics
| Standard deviation | 143810476.9 |
|---|---|
| Coef of variation | 1.731453903 |
| Kurtosis | 31.71050294 |
| Mean | 83057641.13 |
| MAD | 85492413.14 |
| Skewness | 4.381861616 |
| Sum | 2.804856541e+11 |
| Variance | 2.068145327e+16 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 1100000 | 15 | 0.4% | |
| 1200000 | 15 | 0.4% | |
| 1900000 | 14 | 0.4% | |
| 3700000 | 14 | 0.4% | |
| 1300000 | 13 | 0.3% | |
| 2800000 | 12 | 0.3% | |
| 2200000 | 12 | 0.3% | |
| 1400000 | 12 | 0.3% | |
| 4200000 | 11 | 0.3% | |
| 5400000 | 11 | 0.3% | |
| Other values (1692) | 3248 | 84.6% | |
| (Missing) | 460 | 12.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 100 | 1 | < 0.1% | |
| 900 | 1 | < 0.1% | |
| 1700 | 1 | < 0.1% | |
| 4500 | 1 | < 0.1% | |
| 5300 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2029200000 | 1 | < 0.1% | |
| 1937900000 | 1 | < 0.1% | |
| 1528100000 | 1 | < 0.1% | |
| 1369500000 | 1 | < 0.1% | |
| 1163000000 | 1 | < 0.1% |
overseas-pct
Numeric
| Distinct count | 906 |
|---|---|
| Unique (%) | 23.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 43.04005734 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 12.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 22.7 |
| Median | 46.6 |
| Q3 | 62.4 |
| 95-th percentile | 88.02 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 39.7 |
Descriptive statistics
| Standard deviation | 27.0933746 |
|---|---|
| Coef of variation | 0.6294920656 |
| Kurtosis | -0.8149988889 |
| Mean | 43.04005734 |
| MAD | 22.38286128 |
| Skewness | -0.08226896555 |
| Sum | 165144.7 |
| Variance | 734.0509471 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 0 | 462 | 12.0% | |
| 100 | 18 | 0.5% | |
| 48.8 | 13 | 0.3% | |
| 44.2 | 12 | 0.3% | |
| 55.5 | 11 | 0.3% | |
| 56.5 | 11 | 0.3% | |
| 63.6 | 11 | 0.3% | |
| 70.6 | 11 | 0.3% | |
| 54.3 | 11 | 0.3% | |
| 54 | 11 | 0.3% | |
| Other values (896) | 3266 | 85.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 462 | 12.0% | |
| 0.1 | 7 | 0.2% | |
| 0.2 | 4 | 0.1% | |
| 0.3 | 2 | 0.1% | |
| 0.4 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 18 | 0.5% | |
| 99.9 | 6 | 0.2% | |
| 99.8 | 6 | 0.2% | |
| 99.7 | 3 | 0.1% | |
| 99.6 | 4 | 0.1% |
popularity
Numeric
| Distinct count | 3496 |
|---|---|
| Unique (%) | 91.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 13.62128303 |
|---|---|
| Minimum | 0.6 |
| Maximum | 452.653 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 3.7568 |
| Q1 | 7.938 |
| Median | 11.515 |
| Q3 | 16.141 |
| 95-th percentile | 28.891 |
| Maximum | 452.653 |
| Range | 452.053 |
| Interquartile range | 8.203 |
Descriptive statistics
| Standard deviation | 13.32344393 |
|---|---|
| Coef of variation | 0.9781342842 |
| Kurtosis | 386.9207152 |
| Mean | 13.62128303 |
| MAD | 6.272453193 |
| Skewness | 14.7621721 |
| Sum | 52264.863 |
| Variance | 177.5141582 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 0.6 | 10 | 0.3% | |
| 1.4 | 4 | 0.1% | |
| 9.567 | 3 | 0.1% | |
| 15.907 | 3 | 0.1% | |
| 10.869 | 3 | 0.1% | |
| 13.92 | 3 | 0.1% | |
| 13.506 | 3 | 0.1% | |
| 11.923 | 3 | 0.1% | |
| 10.472 | 3 | 0.1% | |
| 12.067 | 3 | 0.1% | |
| Other values (3486) | 3799 | 99.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.6 | 10 | 0.3% | |
| 0.617 | 1 | < 0.1% | |
| 0.679 | 1 | < 0.1% | |
| 0.745 | 1 | < 0.1% | |
| 0.746 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 452.653 | 1 | < 0.1% | |
| 270.012 | 1 | < 0.1% | |
| 262.515 | 1 | < 0.1% | |
| 151.174 | 1 | < 0.1% | |
| 145.496 | 1 | < 0.1% |
revenues
Highly correlated
This variable is highly correlated with overseas-gross and should be ignored for analysis
| Correlation | 0.9717238163 |
|---|
Romance
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 3098 | 80.7% | |
| 1 | 739 | 19.3% |
runtime
Numeric
| Distinct count | 131 |
|---|---|
| Unique (%) | 3.4% |
| Missing (%) | < 0.1% |
| Missing (n) | 1 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 110.0625652 |
|---|---|
| Minimum | 27 |
| Maximum | 338 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 27 |
|---|---|
| 5-th percentile | 86 |
| Q1 | 96 |
| Median | 106 |
| Q3 | 121 |
| 95-th percentile | 147 |
| Maximum | 338 |
| Range | 311 |
| Interquartile range | 25 |
Descriptive statistics
| Standard deviation | 20.25014999 |
|---|---|
| Coef of variation | 0.1839876252 |
| Kurtosis | 5.863595766 |
| Mean | 110.0625652 |
| MAD | 15.41567891 |
| Skewness | 1.315762642 |
| Sum | 422200 |
| Variance | 410.0685748 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 100 | 119 | 3.1% | |
| 105 | 102 | 2.7% | |
| 93 | 97 | 2.5% | |
| 97 | 96 | 2.5% | |
| 96 | 96 | 2.5% | |
| 101 | 95 | 2.5% | |
| 95 | 92 | 2.4% | |
| 106 | 90 | 2.3% | |
| 98 | 87 | 2.3% | |
| 91 | 87 | 2.3% | |
| Other values (120) | 2875 | 74.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 27 | 1 | < 0.1% | |
| 37 | 1 | < 0.1% | |
| 38 | 1 | < 0.1% | |
| 39 | 2 | 0.1% | |
| 41 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 338 | 1 | < 0.1% | |
| 216 | 1 | < 0.1% | |
| 214 | 1 | < 0.1% | |
| 213 | 1 | < 0.1% | |
| 201 | 1 | < 0.1% |
Science_Fiction
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 422 |
| Value | Count | Frequency (%) | |
| 0 | 3415 | 89.0% | |
| 1 | 422 | 11.0% |
studio
Categorical
| Distinct count | 203 |
|---|---|
| Unique (%) | 5.3% |
| Missing (%) | < 0.1% |
| Missing (n) | 1 |
| Uni. | 371 |
|---|---|
| WB | 356 |
| Fox | 339 |
| Other values (199) |
| Value | Count | Frequency (%) | |
| Uni. | 371 | 9.7% | |
| WB | 356 | 9.3% | |
| Fox | 339 | 8.8% | |
| BV | 279 | 7.3% | |
| Sony | 255 | 6.6% | |
| Par. | 230 | 6.0% | |
| LGF | 105 | 2.7% | |
| NL | 103 | 2.7% | |
| FoxS | 103 | 2.7% | |
| Focus | 90 | 2.3% | |
| Other values (192) | 1605 | 41.8% |
| Max length | 11 |
|---|---|
| Mean length | 3.478759447 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
Thriller
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 2792 | 72.8% | |
| 1 | 1045 | 27.2% |
title
Categorical
| Distinct count | 3801 |
|---|---|
| Unique (%) | 99.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Unknown | 2 |
|---|---|
| Life | 2 |
| Raavan | 2 |
| Other values (3798) |
| Value | Count | Frequency (%) | |
| Unknown | 2 | 0.1% | |
| Life | 2 | 0.1% | |
| Raavan | 2 | 0.1% | |
| Fantastic Four | 2 | 0.1% | |
| Kabali | 2 | 0.1% | |
| The Lion King | 2 | 0.1% | |
| Point Break | 2 | 0.1% | |
| Aladdin | 2 | 0.1% | |
| The Mummy | 2 | 0.1% | |
| Frozen | 2 | 0.1% | |
| Other values (3791) | 3817 | 99.5% |
| Max length | 82 |
|---|---|
| Mean length | 14.91842585 |
| Min length | 1 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
TV_Movie
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0 |
|---|
Unnamed_0
Numeric
| Distinct count | 3837 |
|---|---|
| Unique (%) | 100.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1918 |
|---|---|
| Minimum | 0 |
| Maximum | 3836 |
| Zeros (%) | < 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 191.8 |
| Q1 | 959 |
| Median | 1918 |
| Q3 | 2877 |
| 95-th percentile | 3644.2 |
| Maximum | 3836 |
| Range | 3836 |
| Interquartile range | 1918 |
Descriptive statistics
| Standard deviation | 1107.79082 |
|---|---|
| Coef of variation | 0.5775760269 |
| Kurtosis | -1.2 |
| Mean | 1918 |
| MAD | 959.2499348 |
| Skewness | 0 |
| Sum | 7359366 |
| Variance | 1227200.5 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 2676 | 1 | < 0.1% | |
| 2700 | 1 | < 0.1% | |
| 649 | 1 | < 0.1% | |
| 2696 | 1 | < 0.1% | |
| 645 | 1 | < 0.1% | |
| 2692 | 1 | < 0.1% | |
| 641 | 1 | < 0.1% | |
| 2688 | 1 | < 0.1% | |
| 637 | 1 | < 0.1% | |
| Other values (3827) | 3827 | 99.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 3836 | 1 | < 0.1% | |
| 3835 | 1 | < 0.1% | |
| 3834 | 1 | < 0.1% | |
| 3833 | 1 | < 0.1% | |
| 3832 | 1 | < 0.1% |
Unnamed_0_x
Highly correlated
This variable is highly correlated with Unnamed_0 and should be ignored for analysis
| Correlation | 0.9940708728 |
|---|
Unnamed_0_y
Highly correlated
This variable is highly correlated with bo_year_rank and should be ignored for analysis
| Correlation | 1 |
|---|
vote_average
Numeric
| Distinct count | 60 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 6.37211363 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros (%) | 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5.9 |
| Median | 6.4 |
| Q3 | 7 |
| 95-th percentile | 7.7 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range | 1.1 |
Descriptive statistics
| Standard deviation | 0.8407405492 |
|---|---|
| Coef of variation | 0.1319406084 |
| Kurtosis | 2.08536162 |
| Mean | 6.37211363 |
| MAD | 0.6569979329 |
| Skewness | -0.4840546078 |
| Sum | 24449.8 |
| Variance | 0.7068446711 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 6.2 | 213 | 5.6% | |
| 6.3 | 190 | 5.0% | |
| 6.1 | 188 | 4.9% | |
| 6.6 | 183 | 4.8% | |
| 6.4 | 183 | 4.8% | |
| 6 | 169 | 4.4% | |
| 5.9 | 169 | 4.4% | |
| 6.5 | 167 | 4.4% | |
| 6.8 | 163 | 4.2% | |
| 6.7 | 161 | 4.2% | |
| Other values (50) | 2051 | 53.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 2.5 | 2 | 0.1% | |
| 2.7 | 1 | < 0.1% | |
| 2.9 | 1 | < 0.1% | |
| 3 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 10 | 1 | < 0.1% | |
| 8.6 | 1 | < 0.1% | |
| 8.5 | 2 | 0.1% | |
| 8.4 | 9 | 0.2% | |
| 8.3 | 14 | 0.4% |
War
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 108 |
| Value | Count | Frequency (%) | |
| 0 | 3729 | 97.2% | |
| 1 | 108 | 2.8% |
Western
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 0 | |
|---|---|
| 1 | 38 |
| Value | Count | Frequency (%) | |
| 0 | 3799 | 99.0% | |
| 1 | 38 | 1.0% |
worldwide-gross
Highly correlated
This variable is highly correlated with revenues and should be ignored for analysis
| Correlation | 0.9906464716 |
|---|
year
Numeric
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.8% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2008.022935 |
|---|---|
| Minimum | 1989 |
| Maximum | 2019 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1989 |
|---|---|
| 5-th percentile | 1995 |
| Q1 | 2003 |
| Median | 2009 |
| Q3 | 2014 |
| 95-th percentile | 2018 |
| Maximum | 2019 |
| Range | 30 |
| Interquartile range | 11 |
Descriptive statistics
| Standard deviation | 6.992025507 |
|---|---|
| Coef of variation | 0.003482044645 |
| Kurtosis | -0.3584544594 |
| Mean | 2008.022935 |
| MAD | 5.761877971 |
| Skewness | -0.5325710856 |
| Sum | 7704784 |
| Variance | 48.88842069 |
| Memory size | 30.1 KiB |
| Value | Count | Frequency (%) | |
| 2016 | 221 | 5.8% | |
| 2015 | 210 | 5.5% | |
| 2011 | 205 | 5.3% | |
| 2008 | 197 | 5.1% | |
| 2006 | 197 | 5.1% | |
| 2014 | 196 | 5.1% | |
| 2013 | 190 | 5.0% | |
| 2012 | 181 | 4.7% | |
| 2017 | 180 | 4.7% | |
| 2009 | 179 | 4.7% | |
| Other values (21) | 1881 | 49.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1989 | 29 | 0.8% | |
| 1990 | 26 | 0.7% | |
| 1991 | 23 | 0.6% | |
| 1992 | 32 | 0.8% | |
| 1993 | 30 | 0.8% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2019 | 67 | 1.7% | |
| 2018 | 151 | 3.9% | |
| 2017 | 180 | 4.7% | |
| 2016 | 221 | 5.8% | |
| 2015 | 210 | 5.5% |
First rows
| Action | Adventure | Animation | bo_year_rank | budget | Comedy | Crime | Documentary | domestic-gross | domestic-pct | Drama | Family | Fantasy | Film_Genre | History | Horror | imdb_id | Music | Mystery | overseas-gross | overseas-pct | popularity | revenues | Romance | runtime | Science_Fiction | studio | Thriller | title | TV_Movie | Unnamed_0 | Unnamed_0_x | Unnamed_0_y | vote_average | War | Western | worldwide-gross | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 1 | 2 | 94000000 | 0 | 0 | 0 | 339700000.0 | 37.8 | 0 | 1 | 0 | Animation | 0 | 0 | tt0266543 | 0 | 0 | 559500000.0 | 62.2 | 34.178 | 940335536 | 0 | 100.0 | 0 | BV | 0 | Finding Nemo | 0 | 0 | 4 | 1 | 7.8 | 0 | 0 | 899200000.0 | 2003.0 |
| 1 | 0 | 0 | 0 | 2 | 55000000 | 1 | 0 | 0 | 329700000.0 | 48.7 | 1 | 0 | 0 | Comedy | 0 | 0 | tt0109830 | 0 | 0 | 347700000.0 | 51.3 | 37.204 | 677945399 | 1 | 142.0 | 0 | Par. | 0 | Forrest Gump | 0 | 1 | 5 | 1 | 8.4 | 0 | 0 | 677400000.0 | 1994.0 |
| 2 | 0 | 0 | 0 | 9 | 15000000 | 0 | 0 | 0 | 130100000.0 | 36.5 | 1 | 0 | 0 | Drama | 0 | 0 | tt0169547 | 0 | 0 | 226200000.0 | 63.5 | 24.504 | 356296601 | 0 | 122.0 | 0 | DW | 0 | American Beauty | 0 | 2 | 6 | 8 | 8.0 | 0 | 0 | 356300000.0 | 1999.0 |
| 3 | 0 | 0 | 0 | 88 | 12800000 | 0 | 1 | 0 | 4200000.0 | 10.5 | 1 | 0 | 0 | Drama | 0 | 0 | tt0168629 | 1 | 0 | 35800000.0 | 89.5 | 13.044 | 40031879 | 0 | 141.0 | 0 | FL | 0 | Dancer in the Dark | 0 | 3 | 8 | 87 | 7.9 | 0 | 0 | 40000000.0 | 2000.0 |
| 4 | 1 | 1 | 0 | 9 | 90000000 | 0 | 0 | 0 | 63800000.0 | 24.2 | 0 | 0 | 1 | Adventure | 0 | 0 | tt0119116 | 0 | 0 | 200100000.0 | 75.8 | 34.969 | 263920180 | 0 | 126.0 | 1 | Sony | 1 | The Fifth Element | 0 | 4 | 9 | 8 | 7.4 | 0 | 0 | 263900000.0 | 1997.0 |
| 5 | 0 | 0 | 0 | 169 | 0 | 0 | 0 | 0 | 401000.0 | 4.1 | 1 | 0 | 0 | Drama | 0 | 0 | tt0314412 | 0 | 0 | 9300000.0 | 95.9 | 5.300 | 9726954 | 1 | 106.0 | 0 | SPC | 0 | My Life Without Me | 0 | 5 | 11 | 168 | 6.3 | 0 | 0 | 9700000.0 | 2003.0 |
| 6 | 1 | 1 | 0 | 4 | 140000000 | 0 | 0 | 0 | 305400000.0 | 46.7 | 0 | 0 | 1 | Adventure | 0 | 0 | tt0325980 | 0 | 0 | 348900000.0 | 53.3 | 41.679 | 655011224 | 0 | 143.0 | 0 | BV | 0 | Pirates of the Caribbean The Curse of the Blac... | 0 | 6 | 12 | 3 | 7.7 | 0 | 0 | 654300000.0 | 2003.0 |
| 7 | 1 | 0 | 0 | 26 | 30000000 | 0 | 1 | 0 | 70100000.0 | 38.7 | 0 | 0 | 0 | Action | 0 | 0 | tt0266697 | 0 | 0 | 110900000.0 | 61.3 | 25.952 | 180949045 | 0 | 111.0 | 0 | Mira. | 0 | Kill Bill Vol. 1 | 0 | 7 | 13 | 25 | 7.9 | 0 | 0 | 180900000.0 | 2003.0 |
| 8 | 0 | 0 | 0 | 56 | 72000000 | 0 | 0 | 0 | 62700000.0 | 64.7 | 1 | 0 | 0 | Drama | 0 | 0 | tt0418763 | 0 | 0 | 34200000.0 | 35.3 | 14.157 | 96889998 | 0 | 123.0 | 0 | Uni. | 0 | Jarhead | 0 | 8 | 14 | 55 | 6.5 | 1 | 0 | 96900000.0 | 2005.0 |
| 9 | 0 | 0 | 0 | 13 | 14000000 | 0 | 0 | 0 | 101200000.0 | 63.6 | 0 | 0 | 0 | Western | 0 | 0 | tt0105695 | 0 | 0 | 58000000.0 | 36.4 | 21.759 | 159157447 | 0 | 131.0 | 0 | WB | 0 | Unforgiven | 0 | 9 | 18 | 12 | 7.9 | 0 | 1 | 159200000.0 | 1992.0 |
Last rows
| Action | Adventure | Animation | bo_year_rank | budget | Comedy | Crime | Documentary | domestic-gross | domestic-pct | Drama | Family | Fantasy | Film_Genre | History | Horror | imdb_id | Music | Mystery | overseas-gross | overseas-pct | popularity | revenues | Romance | runtime | Science_Fiction | studio | Thriller | title | TV_Movie | Unnamed_0 | Unnamed_0_x | Unnamed_0_y | vote_average | War | Western | worldwide-gross | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3827 | 0 | 0 | 0 | 22 | 75000000 | 1 | 0 | 0 | 120600000.0 | 30.5 | 0 | 0 | 0 | Comedy | 0 | 0 | tt6911608 | 0 | 0 | 274400000.0 | 69.5 | 12.908 | 167225525 | 1 | 113.0 | 0 | Uni. | 0 | Mamma Mia Here We Go Again | 0 | 3827 | 9686 | 21 | 7.2 | 0 | 0 | 395000000.0 | 2018.0 |
| 3828 | 0 | 0 | 0 | 327 | 1089360 | 0 | 0 | 0 | 53200.0 | 43.6 | 1 | 0 | 0 | Drama | 0 | 0 | tt1337051 | 0 | 0 | 68800.0 | 56.4 | 1.809 | 48298 | 0 | 115.0 | 0 | IFC | 0 | Police Adjective | 0 | 3828 | 9694 | 326 | 6.6 | 0 | 0 | 122000.0 | 2009.0 |
| 3829 | 0 | 0 | 0 | 147 | 0 | 0 | 0 | 0 | 12600000.0 | 54.0 | 1 | 0 | 0 | Drama | 0 | 0 | tt0796307 | 0 | 0 | 10700000.0 | 46.0 | 5.687 | 23311391 | 0 | 106.0 | 0 | Wein. | 0 | Under the Same Moon | 0 | 3829 | 9704 | 146 | 7.3 | 0 | 0 | 23300000.0 | 2008.0 |
| 3830 | 1 | 0 | 0 | 91 | 25000000 | 0 | 0 | 0 | 35400000.0 | 65.8 | 0 | 0 | 0 | Action | 0 | 0 | tt6850820 | 0 | 0 | 18400000.0 | 34.2 | 16.008 | 48818723 | 0 | 102.0 | 0 | STX | 1 | Peppermint | 0 | 3830 | 9705 | 90 | 6.5 | 0 | 0 | 53800000.0 | 2018.0 |
| 3831 | 0 | 1 | 0 | 32 | 95000000 | 0 | 0 | 0 | 88800000.0 | 39.2 | 0 | 1 | 1 | Adventure | 0 | 0 | tt0814255 | 0 | 0 | 137700000.0 | 60.8 | 25.447 | 226497209 | 0 | 119.0 | 0 | Fox | 0 | Percy Jackson The Olympians The Lightning Thief | 0 | 3831 | 9710 | 31 | 6.1 | 0 | 0 | 226500000.0 | 2010.0 |
| 3832 | 0 | 0 | 0 | 53 | 48000000 | 1 | 0 | 0 | 67400000.0 | 55.9 | 1 | 1 | 0 | Comedy | 0 | 0 | tt7401588 | 0 | 0 | 53200000.0 | 44.1 | 18.839 | 14700000 | 0 | 118.0 | 0 | Par. | 0 | Instant Family | 0 | 3832 | 9711 | 52 | 7.6 | 0 | 0 | 120600000.0 | 2018.0 |
| 3833 | 0 | 0 | 0 | 104 | 16800000 | 1 | 0 | 0 | 10500000.0 | 20.7 | 1 | 0 | 0 | Comedy | 0 | 0 | tt2870756 | 0 | 0 | 40500000.0 | 79.3 | 9.575 | 51029361 | 1 | 97.0 | 0 | SPC | 0 | Magic in the Moonlight | 0 | 3833 | 9715 | 103 | 6.5 | 0 | 0 | 51000000.0 | 2014.0 |
| 3834 | 0 | 0 | 0 | 57 | 0 | 0 | 0 | 0 | 35400000.0 | 96.8 | 1 | 0 | 0 | Thriller | 0 | 1 | tt6722030 | 0 | 1 | 1200000.0 | 3.2 | 17.811 | 25724305 | 0 | 102.0 | 0 | SGem | 1 | The Intruder | 0 | 3834 | 9717 | 56 | 6.2 | 0 | 0 | 36600000.0 | 2019.0 |
| 3835 | 0 | 0 | 0 | 20 | 20000000 | 0 | 0 | 0 | 175000000.0 | 68.6 | 0 | 0 | 0 | Thriller | 0 | 1 | tt6857112 | 0 | 0 | 80100000.0 | 31.4 | 37.471 | 254664460 | 0 | 116.0 | 0 | Uni. | 1 | Us | 0 | 3835 | 9720 | 19 | 7.0 | 0 | 0 | 255100000.0 | 2019.0 |
| 3836 | 0 | 0 | 0 | 183 | 0 | 1 | 0 | 0 | 2100000.0 | 56.1 | 1 | 0 | 0 | Comedy | 0 | 0 | tt5884230 | 0 | 0 | 1700000.0 | 43.9 | 7.574 | 2400000 | 0 | 101.0 | 0 | Annapurna | 0 | Brads Status | 0 | 3836 | 9724 | 182 | 6.1 | 0 | 0 | 3800000.0 | 2017.0 |